Disturbance Decoupling for Gradient-Based Multi-Agent Learning With Quadratic Costs

نویسندگان

چکیده

Motivated by applications of multi-agent learning in noisy environments, this letter studies the robustness gradient-based dynamics with respect to disturbances. While disturbances injected along a coordinate corresponding any individual player's actions can always affect overall dynamics, subset players be disturbance decoupled-i.e., such players' are completely unaffected disturbance. We provide necessary and sufficient conditions guarantee property for games quadratic cost functions, which encompass one-shot continuous games, finite-horizon linear (LQ) dynamic bilinear games. Specifically, decoupling is characterized both algebraic graph-theoretic on latter obtained constructing game graph based gradients costs. For LQ we show that imposes constraints controllable unobservable subspaces players. two player within action coordinates payoff matrices. Illustrative numerical examples provided.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-agent Learning and the Reinforcement Gradient

The number of proposed reinforcement learning algorithms appears to be ever-growing. This article tackles the diversification by showing a persistent principle in several independent reinforcement learning algorithms that have been applied to multi-agent settings. While their learning structure may look very diverse, algorithms such as Gradient Ascent, Cross learning, variations of Q-learning a...

متن کامل

An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources

This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...

متن کامل

Improving Agent Performance for Multi-Resource Negotiation Using Learning Automata and Case-Based Reasoning

In electronic commerce markets, agents often should acquire multiple resources to fulfil a high-level task. In order to attain such resources they need to compete with each other. In multi-agent environments, in which competition is involved, negotiation would be an interaction between agents in order to reach an agreement on resource allocation and to be coordinated with each other. In recent ...

متن کامل

Robust Reinforcement Learning Decoupling Control Based on Integral Quadratic Constraints

Abstract. In order to keep stable in reinforcement learning process, a novel robust reinforcement learning decoupling control (RRLDC) based on integral quadratic constraints(IQC)is presented in this paper. It composes of a linear model to approximate the nonlinear plant, a state feedback K controller to generate the basic control law, and an adaptive critic unit to evaluate decoupling performan...

متن کامل

Disturbance Decoupling Problems with Quadratic Stability for Switched Linear Systems via State Feedback

In this paper disturbance decoupling problems without stability and with quadratic stability for switched linear systems are formulated in the framework of the so-called geometric approach. Firstly, necessary and sufficient conditions for the problem without stability to be solvable are given. Secondly, sufficient conditions for the problem with quadratic stability to be solvable are given. Fur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Control Systems Letters

سال: 2021

ISSN: ['2475-1456']

DOI: https://doi.org/10.1109/lcsys.2020.3001240